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ABSTRACT 

Current and future large redshift surveys, as the Sloan Digital Sky Survey IV extended Baryon 
Oscillation Spectroscopic Survey (SDSS-IV/eBOSS) or the Dark Energy Spectroscopic Instru¬ 
ment (DESI), will use Emission-Line Galaxies (ELG) to probe cosmological models by mapping 
the large-scale structure of the Universe in the redshift range 0.6 < z < 1.7. With current data, 
we explore the halo-galaxy connection by measuring three clustering properties of ^-selected 
ELGs as matter tracers in the redshift range 0.6 < 2 ; < 1: (i) the redshift-space two-point cor¬ 
relation function using spectroscopic redshifts from the BOSS ELG sample and VIPERS; (ii) 
the angular two-point correlation function on the footprint of the CEHT-LS; (iii) the galaxy- 
galaxy leasing signal around the ELGs using the CEHTLenS. We interpret these observations 
by mapping them onto the latest high-resolution MultiDark Planck N-body simulation, using 
a novel (Sub)Halo-Abundance Matching technique that accounts for the ELG incompleteness. 
ELGs at z ^ 0.8 live in halos of (1 ± 0.5) x 10^^ h~^MQ and 22.5±2.5% of them are satel¬ 
lites belonging to a larger halo. The halo occupation distribution of ELGs indicates that we 
are sampling the galaxies in which stars form in the most efficient way, according to their 
stellar-to-halo mass ratio. 

Key words: galaxies: distances and redshifts — galaxies: haloes — galaxies: statistics 
— cosmology: observations — cosmology: theory — large-scale structure of Universe 


1 INTRODUCTION 

By investigating the properties of galaxy clustering within 
the cosmic web, it is possible to constrain cosmology and 
infer the growth of structure and the expansion history of 
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the Universe ([Weinberg et al.||2013D . In fact, galaxy cluster¬ 
ing measurements using last-generation large-volume red¬ 
shift surveys, as the Sloan Digital Sky Survey (SDSS; jYorkj 
jet al.j |2000| ; [Gunn et al.j [2006^ jSmee et al.j |2013|) and the 
SDSS-III Baryon Oscillation Spectroscopic Survey (BOSS; 
[Eisenstein et al.||2011t [Dawson et al.||20T^ provide robust 
information about both the evolution of galaxies and the 
cosmological framework in which these complex structures 
live. In order to interpret such measurements, we need to 
understand the relation between the theory-predicted dark 
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matter field and its luminous counterpart i.e., the discrete 
galaxy map (|Cooray Sheth||2002|) . 

Luminous low-redshift galaxies have already been con¬ 
nected to their dark matter halos in a precise manner, 
through weak leasing and clustering analysis as a function of 
galaxy luminosity and stellar mass. [Baldry et al.| (|2004|) , |Z^ 
|havi et aL| (|2011D and |Guo et aL| (|2015|) measured the cluster¬ 
ing properties of the SDSS “blue cloud” and “red sequence” 
in the local Universe (SDSS median redshift z ~ 0.1; |Abaza-| 
[jian et aL||2009|) , as a function of magnitude and color. Their 
results show that at a given luminosity, the blue sample has 
a lower clustering amplitude and a smaller correlation length 
compared to the red one. 

|Guo et aL| (|2014D investigated the clustering luminosity 
and colour dependence of BOSS CMASS DRIO (|Anderson| 
|et al.|[20T4|) , and found that more luminous galaxies are more 
clustered and hosted by more massive halos. For luminous 
red galaxies (LRGs), these masses are ~ 
at fixed luminosity, progressively redder galaxies are more 
strongly clustered on small scales, which can be explained by 
having a larger fraction of these galaxies in the form of satel¬ 
lites in massive haloes. [Favole et al.| (|2015|) measured galaxy 
clustering in the BOSS CMASS DRll ([Anderson et al.||2014|) 
sample ai z > 0.55 as a function of color, and proposed 
a new statistic to extract robust information about small- 
scale redshift-space distortions and large-scale galaxy bias. 
Consistent with many previous results (e.g., [Wang et al.[ 
[2007t [Zehavi et al.[ [2(305^ [Swanson et al.[ [2008[) , they found 
that, compared to the blue population, red galaxies reside 
in more massive halos, show a higher clustering amplitude, 
large-scale bias and peculiar velocities. 

This type of clustering analysis has recently been ex¬ 
tended to higher redshifts thanks to the VIMOS Public Ex- 
tragalactic Survey (VIPERS; [Guzzo et al.[[2014t [Garilli et al.[ 
[2014[) and DEEP2 survey ([Newman et al.[[2013[) . Compared 
to DEEP2, VIPERS has a much larger volume but has a 
lower redshift limit however, the signal-to-noise ratio in its 
spectroscopic measurements is higher. Using VIPERS data, 
[Marulli et al.[ ([2013[) measured the clustering properties of 
galaxies at redshift z = 0.8 as a function of their luminos¬ 
ity and stellar mass, and found that the clustering amplitude 
and the correlation length increase with these two quantities; 
see also the PRIsm MUlti-object Survey (PRIMUS) results 
by [Skibba et al.[ ([2015[) and [Bray et ^ ([2015[) . [Mostek et al.[ 
([2013[) measured the clustering of the red sequence and the 
blue cloud at z = 0.9, as a function of their stellar mass and 
star formation history, using DEEP2 data. They argued that 
blue galaxies are more clustered in the local Universe than 
at z = 0.9, and red galaxies are much more clustered locally 
than at high redshift. They also suggested that the clus¬ 
tering trend observed with star formation rate (SER) can 
be explained mostly by the correlation between stellar mass 
and clustering amplitude for blue galaxies. [Coil et aL| ([2008[) 
studied the DEEP2 clustering dependence on color and lu¬ 
minosity, and found that the dependence on color is much 
stronger than with luminosity, and is as strong with color at 
z ~ 1 as locally. They claimed no dependence of the clus¬ 
tering amplitude on color for galaxies in the red sequence, 
but a significant dependence for galaxies within the blue 
cloud. [Cooper et al.[ ([2008[) investigated the connection be¬ 
tween star formation (SE) and environment in DEEP2 data 
at z ~ 0.1, and z ~ 1. Their results indicate that, locally. 


galaxies in regions of higher over density have lower star for¬ 
mation rates (SERs), and their stars form more slowly than 
in their counterparts in lower density regions. At z ~ 1, 
this SER-overdensity relation is inverted; this is in part due 
to a population of bright, blue galaxies in dense environ¬ 
ments, which lacks a counterpart in the local Universe, and 
is thought to evolve into members of the red sequence from 
redshift 1 to 0. 

The combination of clustering with weak galaxy-galaxy 
leasing (see e.g., |Bartelmann[[1999[) allows one to gain insight 
on the large-scale structure formation, and directly probe 
the stellar-to-halo mass relation (SHMR; [Leauthaud et aL[ 
[2011[) . The galaxy-halo connection has been measured at 
z < 1 by [Leauthaud et ah] ([2012[) , [Shan et al.[ ([2015[) , and 
[Coupon et al.[ ([2015[) , using three different weak leasing sur¬ 
veys (COSMOS: [Scoville et al.[ ([2007[) ; CEHT-Stripe82 and 
CFHTLenSQ: iHeymans et al.| (|2Cll2aD ; |Erben et al.| (|2013aD ); 
all obtained consistent results. [Leauthaud et al.| (|2012|) per- 
formed the first joint analysis of galaxy-galaxy weak leasing, 
galaxy clustering, and galaxy number densities using COS¬ 
MOS data, and provided robust constraints on the shape and 
redshift evolution of the SHAM relation in the redshift range 
0.2 < < 1. At low stellar mass, the halo mass scales propor¬ 
tionally to this scaling does not evolve significantly 

with redshift. At M* > 5 x lO^^M©, the SHMR rises sharply, 
causing the stellar mass of a central galaxy to become a 
poor tracer of its parent halo mass. Combining observations 
in the CEHT-LenS/VIPERS field from the near-UV to the 
near-IR, [Coupon et al.[ ([2015[) found that the SHMR for the 
central galaxies peaks at Mu,peak — (l-9to.i and 

its amplitude decreases as the halo mass increases. [Hearin[ 
[et al.[ ([2014[) presented new measurements of the galaxy two- 
point correlation function and the galaxy-galaxy leasing sig¬ 
nal from SDSS, as a function of color and stellar mass, and 
demonstrated that the age-matching model ([Hearin fc Wat-[ 
which states that older halos tend to host galax¬ 
ies with older stellar populations, exhibits remarkable agree¬ 
ment with these and other statistics of low-redshift galaxies. 

Current (Sub)Halo-Abundance Matching (SHAM; [Con-[ 


[roy et umm id'ruiiiio-Comer^mipnTTt [Kiypin et aimm 

[Nuza et al.[[2013[) and Halo Occupation Distribution (HOD; 
[Berllnd V WemEi?g[2TO [Kravtsov et al.[[2Uni[; [Zheng eill] 
[2005[ , [2007[) models correctly reproduce the clustering mea¬ 
surement mentioned above. SHAM maps observed galaxies 
onto dark matter halos directly from N-body cosmological 
simulations, according to a precise monotonic correspon¬ 
dence between halo and galaxy number densities. The HOD 
method is an analytical prescription to populate simulated 
halos with galaxies, in which the assignment is perfomerd 
by interpolating the halo occupation distribution at the val¬ 
ues of the desired halo masses. In this sense, the SHAM 
approach returns a model which is built directly on the con¬ 
sidered simulation box. 

Next generation high-redshift surveys as SDSS- 
IV/eBOSS (Dawson et al. 2015, in prep.), Subaru Prime 
Eocus Spectrograph (PES; [Sugai et al.[ [2012t [Smee et al. 

, DESI (iSchleRel et al.||2015D , 4MOST]1 and Euclid^ 


^ http://www.cfht.hawaii.edu/Science/CFHTLS/ 
^ https://www.4most.eu/cms/ 

^ http://sci.esa.int/euclid/ 
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([Laureijs et al.|[2011t [Sartoris et al.|[20T5|) will use emission¬ 
line galaxies (ELGs) as BAO tracers to explore the Uni¬ 
verse large-scale structure out to z ~ 2. Observing ELGs, 
learning how to model their clustering properties and under¬ 
standing how they populate their host halos are therefore 
crucial points that we need to understand in order to select 
the targets for future experiments. Erom the observational 
point of view, the recent increment of available ELG spec¬ 
troscopic data (IGuzzo et al.| [2014^ [Gomparat et al.| |2015|) 
allows one to measure their clustering properties over about 
12 deg^ at z = 0.8 (corresponding to a comoving volume 
of U ~ 10.6 X 10® /i“^Mpc^ in the Planck cosmology; see 
Section ^ for details), which represents a dramatic improve¬ 
ment. 

[Gomparat et al.| (|2013|) demonstrated that neither a 
standard HOD nor a traditional SHAM technique are able 
to reproduce the angular clustering of ELGs on small scales. 
In fact, both techniques are based on the assumption that 
the galaxy sample to model is complete, but this is not the 
case of the ELGs, which are highly incomplete in stellar 
mass. One could instead use semi-analytic models of galaxy 
formation and hydrodynamic simulations, but they lack of 
mass resolutions to model emission line galaxies. 

The aim of this work is to provide a modified version 
of the standard SHAM prescription, directly based on the 
latest MultiDark N-body simulation with Planck cosmol¬ 
ogy, that accounts for the ELG incompleteness and returns 
suitable mock galaxy catalogs able to accurately predict the 
ELG angular and redshift-space clustering, respectively, on 
small and larger scales. These mock catalogs are released to 
the public. 

The paper is organized as follows. Section [^ describes 
the data sets and the MultiDark simulation box used in our 
analysis. In Section [^ we present our ELG clustering and 
weak leasing measurements. In Section^ we explain how we 
model the ELG clustering and we present our main results. 
Section [^ discusses the implications of our ELG clustering 
analysis in a galaxy evolution perspective, and Section [^ 
summarizes our main results. 

Throughout the paper, we assume the Planck cosmol¬ 
ogy ([Planck Gollaboration et al.||2014|) and magnitudes in 
the AB system ([Qke Gunn[ [l983[) . 


2 DATA AND SIMULATION 
2.1 Data sets 

We build our ELG galaxy sample using the Ganada-Erance- 
Hawaii Telescope Legacy Survey (GEHT-LS) Wide TOOOT^ 
photometric redshift catalog ([Ilbert et al.[ [2006^ [Goupon 
[et al.[[2009[) . We apply a ^-band magnitude cut, 20 < ^ < 
22.8 ([Eukugita et al.[[1996[) , to select galaxies with bright 
emission lines and low dust aX z < 1. We also apply a color 
selection, —0.5 < {u — r) < 0.7 {g — i) 0.1, to remove the 
low-redshift galaxies. Eor details on the selection function, 
see [Gomparat et al] ([2015[) . Then, to obtain the largest pos¬ 
sible area, we convert the z-selection into the new Megacam 


^ http://www.cfht.hawaii.edu/Science/CFHTLS/ 


z-band filter^. Eor the Wl, W3 and W40 fields, we derive an 
average density of about 500 ELGs per deg^, 70% of which 
have a photometric redshift in the range 0.6 < z < 1. The 
densities of each field are reported in Table |T|, and the errors 
on the photometric redshift are cr^ < 0.05 (1 + ^) for i < 22.5 
and z < 1. The ugri ELG selection is brighter than i < 22.5. 

We match the photometric targets to the available spec¬ 
troscopic surveys - BOSS DR12, DEEP2, VIPERS ([Bolton[ 
lAlam et al.[PUT3[; [N ewman et 
[et al.[[20l4[) - within 1” radius; see Table Based on KS- 
tests, the VIPERS, BOSS and DEEP2 spectroscopic selec¬ 
tions constitute fair sub-samples of the complete selection: 
the hypothesis that they are drawn from the same distribu¬ 
tion cannot be rejected at the 90% confidence level. Eor these 
samples, we create random catalogs with the same redshift 
distribution of the data and 30 times denser. Eigure dis¬ 
plays the ELG spectroscopic redshift distribution per unit 
volume for the three Wide fields (dashed, dotted, dot-dashed 
histograms), and their mean (solid line). Two thirds of the 
galaxy density is located in the redshift range 0.7 < z < 0.9, 
while both the intervals 0.6 < z < 0.7 and 0.9 < z < I con¬ 
tain one sixth of the sample. According to the ELG selection 
function in [Gomparat et al.[ ([20I5[) , we select only galaxies 
at z > 0.6 since we are not interested in low-redshift objects. 
We have investigated further the impact of the higher red¬ 
shift cut, z < I, on the angular clustering by imposing to the 
ELG sample different redshift thresholds: z < 1,1.2,1.4,1.6. 
In all these samples the lower redshift cut is fixed at > 0.6 
and we have imposed the i < 22.5 magnitude cut to elimi¬ 
nate bad photometric redshifts. We find that including also 
ELGs at z ^ I, we are slightly enhancing the galaxy num- 


^ http://www4.cade-ccda.hia-iha.nrc-enre.gc.ca/en/ 
megapipe/docs/filt.html 

http: //www. cf ht. hawaii. edu/Science/CFHLS/T0007/ 
T0007-docsul0.html 
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Figure 2. ELG weighted spectroscopic redshift distribution per 
unit volume for the WI, W3, and W4 Wide fields (dashed and 
dotted histograms), and their mean value (solid line). 
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Table 1. ELG photometric data per CFHT-LS Wide field after applying the bright star and bad field mask. 


Field 

Genter o, 5 
area [deg^] 

W1 

35°,-7° 

63.75 

W3 

215°, 54° 

44.22 

W4 

333°, 2° 

23.3 

all 

131.27 

N 

32,808 

22,195 

11,025 

66,028 

N [deg“2] 

514.64 

501.92 

473.18 

502.99 

Zphot quartiles 

0.78 / 0.88 / 1.03 

0.77 / 0.88 / 1.05 

0.78 / 0.88 / 1.03 

0.78 / 0.88 / 1.03 

Zphot Deciles DIO, D90 

0.7 / 1.24 

0.68 / 1.29 

0.69 / 1.25 

0.69 / 1.26 

N (0.6 < Zj)hot < 1) 

23,433 

15,242 

7,861 

46,536 

N (0.6 < Zyhot < 1) [deg-2] 

367.58 

344.69 

337.38 

354.51 

Zphot quartiles 

0.75 / 0.83 / 0.9 

0.74 / 0.81 / 0.89 

0.75 / 0.83 / 0.89 

0.75 / 0.82 / 0.89 



RA [deg] RA [deg] RA [deg] 


Figure 1. Photometric (black points) and spectroscopic (VIPERS: red crosses in the right and left panels; BOSS: magenta crosses 
forming the oval in the central panel; DEEP2: blue crosses in the dashed box in the central panel) coordinates of our ELG sample in the 
three GFHT-LS Wide fields. 


ber density of our sample and consequently suppressing the 
amplitude of w{0)^ but we do no see any substantial change 
in the angular clustering trend with respect to the z < 1 
case. We therefore restrict the analysis to the redshift range 
0.6< z < 1. 


2.2 MultiDark simulations 


and [Kitzbichler fc White] (|2007|) , and can be summarized as 
follows: 

(i) Set the properties of the light-cone: angular mask, 
radial selection function (number density) and number of 
snapshots within the redshift range considered. Each slice 
of the light-cone is constructed by selecting all halos from 
every MDPL snapshot. The thickness of a slice at redshift 
is given by [(zi Zi-i)l2, (zi -k Zi+i)/2] 


The MultiDark Planck simulation (MDPL, [Klypin et aL| 
[2014^ www.MultiDark.org) contains 3840^ particles in a 
1 h~^Gpc box, and was created adopting Planck ACDM cos¬ 
mology ([Planck Collaboration et al.||2014|) . Halos are iden¬ 
tified based on density peaks including substructures using 
the Bound Density Maximum (BDM) halo finder ([Klypin fcj 
[Holtzman|[1997t [Riebe et aL[ [20131) . 

We use the MDPL halo catalogs to build a mock 
light-cone that matches the mean ELG redshift distribution 
shown in Eigure@. Given the high density of the ELG tracers 
and their expected low-mass host halos, the MDPL box is 
an excellent compromise between numerical resolution and 
volume. We apply the SUrvey GenerAtoR code (SUGAR; 
Rodriguez-Torres et al. (2015), in prep.) to the 11 snap¬ 
shots available from MDPL to construct a light-cone with a 
volume ten times the observations that covers the redshift 
range 0.6< z <1 lh~^ Gpc depth). The procedure used 
is analogous to the method presented by [Blaizot et aL[ ([200 5[) 


(ii) Place an observer (i.e., z = 0) inside the box and 
shift the cartesian coordinates of the box in such a way 
that the observer occupies the central point of the box at 
= 0.8 


(iii) Gonvert from cartesian (x, y, z) to spherical (a, S, Tc) 
coordinates, where Vc is the comoving distance in real space. 
The redshift of each point will be: 



cdz 

Hq y/Um(l + z'^^ -f Qa 


( 1 ) 


(iv) Erom each snapshot, select the (sub)halos so that 
{zi + Zi-i)l2 < z < {zi Zi+i)/2 and a/6 he inside the 
sky window. Since the ELG observational data represent 
halos with typical masses ~ lO^^/i^^M©, in the light-cone 
we include all halos for which the simulation is complete 
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Table 2. ELG spectroscopic data. 


survey 

match 

good 2 ; 

0.6 < 2 ; < 1 

area [deg^] 

z 

VIPERS W1 

1,223 

942 

760 

5.478 

0.803 

BOSS W3 

2,145 

1,876 

1,357 

6.67 

0.803 

DEEP2 W3 

225 

222 

156 

0.5 

0.803 

VIPERS W4 

1,148 

846 

680 

5.120 

0.795 

All 

4,741 

3,886 

2,953 

17.668 

0.803 


i.e., logiMH/h-^Me) > 11.2 

(v) Using the halo velocities, Vp, we compute the peculiar 
velocity contribution for each object along the line-of-sight 
and derive its distance in redshift-space as 

s = rc-\-{vp- rc)/{aH{z)), (2) 

where a = (1 + is the scale factor and H{z) is the 
Hubble parameter at redshift z 

(vi) Finally, select objects from the light-cone using our 
selection function. 

Throughout the paper we will designate our lightcone as 
“MDPL-LC”. Section 0| describes in detail the halo selection 
and the (Sub)Halo-Abundance Matching modeling adopted 
to determine the halo occupation distribution of our ELG 
sample. 


3 MEASUREMENTS 

Using the ELG sample described in Section 113 we measure 
both galaxy clustering and galaxy-galaxy leasing. The fol¬ 
lowing provides a detailed description of our measurements. 

3.1 Galaxy Clustering 

We estimate both the angular, w(0), and the redshift-space, 
^(s) (hereafter two-point correlation functions following 
the procedures described by |Landy Szalay| (|1993|) , |Goupon| 
|et al.| (|2012|) and |de la Torre et ah] (|2013|) . 

To compute on the VIPERS and the BOSS ELG 
samples (see Table , independently, we create linear bins in 
separations of l/i“^Mpc at s < 10/i“^Mpc, and 4/i“^Mpc 
for 10 < s < 40/i“^Mpc. We then correct the impact of 
redshift errors and catastrophic redshifts to recover the cor¬ 
relation function down to l/i“^Mpc. The ELG we are tar¬ 
geting are observed using three plates overlapping the same 
area of the sky. This configuration guarantees that all the 
targets are observed at the end of the process and there 
is no fiber collision ([Blanton et al.|[2003t [Reid et ^[20141) . 
We correct for the finite size of VIPERS following jde la[ 
[Torre et al.j ([2013[) , and define the survey completeness in 
terms of target sampling rate (TSR) and the spectroscopic 
success rates (SSR). The first quantity weights our ability 
of obtaining spectra from the potential targets meeting the 
survey selection in the parent photometric sample. Eor each 
galaxy in the spectroscopic catalog, we count the number of 
objects that lie within a given radius in the spectroscopic 
{^targeted) and in thc photometric (Nparent) sample. The 




Figure 3. Two-point angular (top panel) and redshift-space (bot¬ 
tom panel) ELG correlation functions (points), together with our 
best-fit model (blue line), which corresponds to the point high¬ 
lighted by a star in Figure [^ 


TSR is then given by the ratio TSR = Ntargeted I Nparent • 
The SSR represents our ability of determining galaxy red- 
shifts from observed spectra. To compute it, we replace 
the bad redshifts in the spectroscopic catalog with good 
photometric redshifts. Galaxies with good spectra are as¬ 
signed flag=0 (Ngood)] galaxies with replaced redshifts are 
assigned flag=l (Ntargeted)- The SSR is then computed as 
the ratio SSR = Ngood!Ntargeted- The contribution of each 
galaxy in the clustering pair counts is then weighted by 
w — XjiTSR * SSR). Finally, we combine the VIPERS and 
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BOSS measurements weighted by the projected density of 
each field. The resulting redshift-space correlation function 
is displayed in the bottom panel of Figure ^ by fitting a 
power-law model, ^[s) — (s/so)^, in the separation range 
2 < s < 30/i“^Mpc, we find so = (5.3 ± 0.2) /i“^Mpc and 
a = -1.6±0.1. 

Analogously, we calculate the angular 2PCF, w(0)^ us¬ 
ing photometric redshifts from the Wl, W3 and W4 CFHT- 
LS fields. The points are corrected from the integral con¬ 
straint following [Tinker et al.| (|2010|) and [Coupon et al.j 
([2012[) to account for the restricted area of observation. On 
scales 0 < 0.05°, all three fields provide consistent mea¬ 
surements. At larger scales, the clustering signals in the 
Wl and W4 fields do not decrease as rapidly as expected, 
probably pointing to possible systematics that need to be 
investigated further. We therefore use only the measure¬ 
ment on the W3 field, which appears the most robust (see 
Figure ^ top panel). The w{0) of the W3 field is in per¬ 
fect agreement with Figure 9 (panel 4) in [Comparat et al.[ 
(|20T1 . This result was computed on the Stripe 82 region 
([Stoughton et al.][2002[) , with three times larger area. At 
the mean redshift of the sample, z = 0.8, one degree corre¬ 
sponds to 18.847/i“^Mpc; thus, w{0) spans the range from 
~ 40 h~^kpc up to ~ 20 /i“^Mpc. We investigate further the 
impact on the clustering amplitude of including the tails of 
the photometric distribution, i.e. Zphot <0.6 and Zphot > 1- 
This inclusion does not produce any substantial change in 
w{0), except for some additional noise. We also test how pho¬ 
tometric uncertainties affect the clustering errors computed 
via mock resampling. To this purpose, following [Coupon[ 
[et al.[ ([2012[) , we perturb our original redshift distribution by 
applying a photometric scatter with mean cr^ = 0.035(1+ z). 
We then quantify the number of photometric objects that, 
due to this scatter, enter the ELG selection in the range 
0.6 < z < 1 from the lower and higher tails of the distri¬ 
bution, and the objects that exit. We find that only 2.5% 
photometric redshifts enter the ELG selection in the range 
0.6 < z < 1 from the upper and lower tails, and their effect 
on the clustering is negligible. 

To estimate the errors on our galaxy clustering mea¬ 
surements, since the simulated light-cone area is larger than 
the data (~560 deg^), we divide the best MDPL-LG model 
into independent (i.e., non-overlapping) realizations of our 
ELG data (8 for the photometric and 24 for the spectro¬ 
scopic samples), and obtain sample variance diagonal er¬ 
rors that we use rather than Poisson errors. Including the 
photometric uncertainties in our jackknife resamplings does 
not provoke any significant change in the error estimates. 
We neglect a full-covariance analysis because the number 
of sub-samples we have is too small to produce reliable co- 
variance estimates. Including also the off-diagonal elements 
of the covariance matrices would result in large fluctuations 
of the error bars. Of course, excluding covariances we are 
adopting a simplified approach, but it provides a good sense 
of how the SDSS BOSS ELG clustering behaves. On the 
other hand, the ELG sample considered here is too sparse 
to derive tight constraints from our clustering analysis. New- 
generation large-volume spectroscopic surveys as eBOSS, 
DESI and 4MOST, will provide new data with unprece¬ 
dented statistics, sky coverage and imaging quality. Using 
those data, a fully covariant approach will return reliable 
and accurate error estimates. 


We compare the combined measurement from BOSS 
and VIPERS to previous measurements by [IVlarulli et al.[ 
([2013[) to provide a first interpretation. Our result matches 
both the clustering signal of galaxies selected in the stellar 
mass range 9.5 < log (Mhc/Zi^^M©) < 11, and the cluster¬ 
ing of galaxies selected by absolute magnitude in the in¬ 
terval —22 < Mb — 51og(/i) < —20.5. Using the stellar-to- 
halo mass relation from Leanthaud et al.| ([2012[) , [Shan et 
([2015[) and [Goupon et al.[ ([2015[) , we can deduce a rough es¬ 
timate of the halo masses populated by our ELG sample i.e., 
11.6 < log (M/j//i“^Mo) < 12.7. These halo masses are typ¬ 
ical of Milky-Way size halos, being much less massive than 
those hosting the LRG sample, see [Nuza et"aL| ([2013[) . 

In the angular clustering measurement, the change of 
slope occurs at ^ ~ 0.01°, corresponding to ~ 200 h~^kpc. 
Using MDPL, we derive the relation between halo mass and 
virial radius at ~ 0.8; halos with virial radius ~ 200/i“^kpc 
occupy the mass range Mh = (0.5 — 1) x lO^^/i^^M©. Since 
a single galaxy per halo would not induce such a change in 
the w{0) slope, this result implies a satellite fraction of ap¬ 
proximately 22.5% (see Section^. Eigure ^ displays a good 
agreement between our clustering measurements and predic¬ 
tions for ELG halos of mass 10 ^^/i“^Mq with this satellite 
fraction. 


3.2 Weak leasing 

We use the latest weak leasing catalogs produced 
by the Ganada-Erance-Hawaii Telescope Leasing Survey 
(GEHTLenS; [Heymans et al.[[2012bt [Erben et al.[[2013b|) on 
the Wl and W3 fields to measure the galaxy-galaxy leasing 
around 47,485 ELG lenses. This measurement allows one to 
constrain the halo masses. We follow [Gillis etaTI ([2013[) and 
apply only the multiplicative correction, rus, to the shear 
measurement and avoid the c2 correction. We measure the 
tangential shear, 7^, around the photometric ELG sample 
as a function of the radial distance from the lenses using the 
([Velander et al.[[2011[) estimator: 


'Eis Wls^ls^c' 

/ 

±^s)' 


/ 

T,is 


where the sum runs over the lens - source pairs (Is) and the 
wis values are the weight obtained by lensfit. 

Since the lenses are at the higher tail of the redshift 
distribution and the ELGs are expected to live in low-mass 
halos, we recover a low signal-to-noise ratio around 2 for 
R < 1 Mpc. 

We model the measurement using a truncated Navarro, 
Erank V White (NEW) halo profile ([Baltz et al.[[2009[) and 
the mass-concentration relation from [Neto et aTI ([2007[) to 
truncate halos at half their concentration ([Wright V Brain^ 
|e?d[[2000D . The best-fit model suggests typical halo masses of 
M200 = 1.25±0.45 X 10^^h~^ Mq. The lower and upper mass 
limits are, respectively, M200 = 5.61 ± 7.20 x 10^^h~^M q 
and M200 = 1.41 ±0.51 x 10^^h~^M q] see Eig. ^ This mea¬ 
surement is in good agreement with the first interpretations 
based on the clustering (see Section . 
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Figure 4. ELG surface density (AE) as a function of the physical 
scale for different lens models. 


4 HALO OCCUPATION FOR EMISSION LINE 
GALAXIES 

The (Sub)Halo-Abundance Matching (SHAM; e.g., |Con-| 
|roy et all [2006^ [Trujillo- Gomez et al.| [20111) technique is a 
straightforward method to link observed galaxies with dark- 
matter-only simulated halos. It relies in a monotonic corre¬ 
spondence between halo and galaxy number densities, which 
is based on the assumption that more luminous galaxies re¬ 
side in more massive halos. Such association is performed 
by choosing suitable proxies for both halos and galaxies 
(e.g., the halo maximum circular velocity and the galaxy 
luminosity or stellar mass) and includes some scatter (see 
[Trujillo- Gomez et al.[ ([2011[) for details). The advantage of 
using iV-body simulations, compared to analytical models, 
is given by the accuracy achieved in the predictions of the 
clustering for a given halo population. Many state-of-the art 
clustering measurements have been modeled using a SHAM 
technique that maps the observations onto suitable high- 
resolution N-body simulations, allowing the interpretation 
of the halo occupation distribution and bias ([de la Torre[ 
[et al.[ [2013t [Nuza et al.[ [2013^ [Carretero et al.[[2015[) . [Wat-[ 
[son et aL| ([2015[) recently presented a method to upgrade 
SHAM models to account for differences between quenched 
and star-forming galaxies. 

In the specific case of the emission-line galaxies, the 
traditional SHAM approach cannot be applied since it re¬ 
quires a complete galaxy sample, and ELGs are far from 
being complete in any parameter space, even in terms of 
their emission line luminosity, see [Comparat et ah] ([2013[) . 
We therefore must modify the standard SHAM procedure 
to take into account the ELG incompleteness and match 
their clustering amplitude. To this purpose, we selected ha¬ 
los and subhalos by mass (for the subhalos we considered 
only the mass of the bound particles, to avoid ambiguities) 
to be able to compare directly with the weak leasing mea¬ 
surements. In the future, provided a high signal-to-noise ra¬ 
tio in the clustering measurement, we will properly select 


(sub)halos by their maximum circular velocity at accretion, 
(e.g., [Behroozi et al.[|^I3b[) . 

In order to model both the I-halo and the 2-halo 
terms in the ELG two-point correlation functions and the 
weak leasing measurement, we use the MultiDark Planck 
I/i“^Gpc^ box (see Section [2^ , which represents the best 
compromise between high resolution and volume, as previ¬ 
ously described in Section [^ 

We parametrize the probability of selecting a halo host¬ 
ing an ELG as follows: 

^mean-i G'M -j fsat) — 

fsat-^^^meani ^M — Sat)T (4) 

(I fsat)-^ (Af/j,, M^rneari') CFM , flag — CCn) 

where A/" is a Gaussian distribution with the variable being 
Mh^ the halo mass. The parameters are: Mmean, the mean 
halo mass of the sample including both host and satellite 
halos; ctm, the dispersion around the mean halo mass; fsat, 
the satellite fraction. The additional parameter “flag” en¬ 
ables to identify among the halos the ones that are centrals 
(flag=cen) or the ones that are satellites (flag=sat). 

To qualitatively understand the dependence of cluster¬ 
ing on Mmean and fsat, wc imposc (i) a maximum halo 
mass threshold to the MDPL-LC by removing all halos with 
Mh > Mmax and we apply the standard SHAM proce¬ 
dure. The higher-mass (Mmax > lO^^/i^^M©) models re¬ 
produce well the observed w{0), and that the lower-mass 
models (Mmax < lO^^/i^^M©) match the large-scale clus¬ 
tering, but not the small-scale amplitude witnessed below 
0 ~ 0.01°. The top row in Eigure |5| displays the ratio be¬ 
tween the angular (left panel) and the monopole (right) cor¬ 
relation functions of the lower-mass models and the model 
with Mmean — lO^^/i^^M©. Wc scc a mild variation in w{d) 
as a function of the physical scale, and a flatter trend in the 
monopole. 

We next (ii) fix the halo mass by selecting all the halos 
in the mass bin Mh = (I ± 0.5 x 10^^ /i^^M©), and vary the 
satellite fraction. We split this halo catalog into two cata¬ 
logs, one containing only central halos {fsat = 0) and one 
with satellites; then downsample both mocks to match the 
ELG n{z). The bottom panels in Eigure [^present the varia¬ 
tion of the angular and monopole clustering as a function of 
the scale. At small scales the amplitude of w{0) with more 
than 30% satellite fraction is strongly enhanced compared to 
the 10 — 20% cases. In the monopole there is almost no vari¬ 
ation with the scale. We then combine these two products 
to build galaxy mock catalogs that contain a fsat fraction of 
satellites (taken from the satellite-only mock) and {I-fsat) 
centrals (from the central-only mock). Satellite fractions be¬ 
tween 20% and 30% account for the clustering signal on both 
small and large scales; see Eigure^ All the selections above 
are done on the halo mass defined as M 200 , which correspond 
to an overdensity threshold of A 200 = 200pc ([Prada et al.[ 
[20I2[) , where pc is the critical density of the Universe. 

To produce a mock catalog, we randomly select halos 
from the light-cone according to the probability distribution 
P, defined in Eq. ^ until the ELG redshift distribution n{z) 
in Eigure [^ is achieved. We then construct a grid of mocks 
by selecting Mmean in the range 10^^'^ — 10^^'^ gm 

between the values Mmean/[I-, 2., 4.] (the sampling 

space is three times larger), and the satellite fraction in the 
interval 0 < fsat < 0.5, to obtain predictions for both and 
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Figure 5. Left column, top panel: ratio of the angular correlation functions of the MDPL-LC halos selected by mass, to w(6) computed 
at Mmean = The curves in the plot go from lower mass (bottom line) to higher mass (top line). Left column, bottom panel: 

ratio of the angular correlation functions of the MDPL-LC halos with varying satellite fraction, to w{6) computed at fsat = 0. The lines 
in the plot go from lower fsat (bottom line) to higher fsat (top line). Right column: same results for the monopole. The top row presents 
our first experiment (see the text for details) on the lightcone: we impose different halo mass thresholds to the MDPL-LC and apply a 
standard SHAM. The bottom row displays SHAM in the mass bin = (1 ± 0.5 x 10^^ h^^M©) with varying satellite fractions. 


w{0). Finally, we compare these model predictions with our 
measurements by computing a combined scales 2 < 

s < 22/i“^Mpc for the monopole, and 0.002° < 0 < 0.55° 
for the angular clustering, as follows: 

2 _ s 

^ “ Ni + N^ ’ ^ ’ 

where 


2 

X.w{0) 


1 \Wobserved{^i^ Whalos{^i^\ 

^ ifdJobservedi^^iy) 


and 


2 


A 


\^observed^Si) ^halos 
{^observed (Si)) 


( 6 ) 


(7) 


are degenerate with respect to the mean halo mass and the 
satellite fraction. In fact. Figure shows that a plethora of 
(log Mmean,/sat) models fit the data: from (11.3,0.45) by 
(12,0.2) to (12.5,0). Given the 41 degrees of freedom we 
have, we consider acceptable those models with < 1.25. 
Models with a higher value are rejected at the 90% level. 

The combination with the weak leasing results breaks 
this degeneracy and rules out the higher- and lower-mass 
models. However, among these latter, there is one with 
= 1 and parameters: log Mmean = 12 , ctm = Mmean/2, 
fsat = 22.5% (star symbol in Figure |^. The angular and 
redshift-space correlation functions of this best-fit mock are 
displayed in Figure ^ (blue line), together with the ELG 
measurements. The weak leasing measurement are perfectly 
compatible with this best-fit model. 


The possible models accounting for the ELG clustering 


We provide our best-fit MDPL mock catalog to the 
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Figure 6. The two parameters driving the model: fraction of 
satellite (fsat) and mean halo mass (Mmean)- The spread around 
the mean halo mass is fixed at the value cfm = Mmean The ver¬ 
tical black lines represent the constraints by weak leasing (dashed: 
lower and upper limits; solid: mean), which rule out the major¬ 
ity of the low-mass and high-mass models. Our best-fit model is 
highlighted by the star symbol. 


ELG clustering measurements at http://projects.ift. 
uam-csic.es/skies-universes/. 


5 RESULTS AND DISCUSSION 

5.1 ELG clustering trends as a function of 

magnitude, flux, luminosity and stellar mass 

We employ the complete VIPERS data sample at z ~ 0.8, 
which has about 30,000 reliable redshifts in the range 
0.6 < z < 1, to investigate trends of the clustering am¬ 
plitude (bias) with observed or rest frame broad band mag¬ 
nitude or emission line flux. To this purpose, we measure 
the emission line properties in the VIPERS spectra and find 
a significant [On] flux in about two thirds of them; the rest 
does not show emission lines (Comparat et ah, in prep.). 
We bin the data according to apparent and absolute mag¬ 
nitude, [On] flux and luminosy, and measure the clustering 
in each sample (the binning scheme was set to contain be¬ 
tween 9000 and 10,000 data points). Eigure shows our 
ELG results in the observed (bottom row) and rest frame 
(top row). Gonsistently with previous analyses (e.g., |Marulli| 
|et al.||20f3t [Mostek et al.||2013|) , we find that the brighter the 
selection in the z-band, either observed or rest-frame, the 
higher the bias. Analogously, the fainter the p-band limit, 
either observed or rest-frame, the higher the bias. The anti¬ 
correlation between [On] flux and bias is only seen in the 
observed frame (the difference is ~ 1.4); in the rest frame 
it is not significant. It would be interesting to further inves¬ 
tigate the correlation between [On] luminosity and ^-band 
magnitude in the small-scale clustering, but with the reso¬ 
lution of current data we are not able to push the analysis 
to scales ~ 200/i“^kpc, which is the typical virial radius of 
a halo of mass New data from eBOSS will be 

able to address this issue. The results above indicate that 


if we have a ^-selected ELG sample and [On] fluxes for a 
certain number of its galaxies, in order to maximize its clus¬ 
tering signal, we should select the ELGs with brighter i- 
band magnitudes. To investigate the clustering dependence 
on stellar mass, we map the host halo masses for ELGs at 
z ~ 0.8, Mh ~ lO^^/i“^M0, onto stellar mass values using 
the stellar-to-halo-mass relation by [Leant hand et al.| (|2012|) , 
see their Eigure 11. Our data are right before the “knee” at 
M* - 3.5 X 10^°/i“^Mo. 


5.2 Star formation efficiency 

Erom our analysis, the typical halo masses hosting ELGs at 
z - 0.8 are Mh - (l=b0.5) x 10^^/i“^Mo, and 22.5%±2.5% 
of them are satellites belonging to a larger halo, whose cen¬ 
tral is a quiescent galaxy. Eigure jSj provides a schematic 
representation of the possible ELG configurations. A total 
of 22.1% ELGs are single satellites belonging to a parent 
halo with mass MhQ ~ 2.5 x 10^^ only in 1.3% of the 

cases the parent halo hosts more than one satellite ELG. The 
maximum number of satellites, n = 1.8, is achieved in the 
highest-mass case, where Mhq ~ 6.8 x These 

results imply that the mean number of ELG satellites is only 
slighlty larger than unity (~ 1.01). The quiescent galaxies 
at the center of the parent halos are not included in the 
sample, since the stellar masses for ELGs from the SHMR 
discussed above are too low for halos of 10^^ h~^MQ. 

The typical masses for halos hosting ELGs suggest that 
we are sampling halos (~ 10^^ that form stars in 

the most efficient way, according to the stellar-to-halo mass 
ratio discussed by [Behroozi et al.j (|2013a|) (see their Eigure 
1, bottom panel). This result opens a new science field and, 
hopefully, in the near future, integrated models combining 
N-body simulations with semi-analytic models (SAMs) will 
be able to probe star formation and shed some light on the 
correlations between [On] flux and magnitude in the cluster¬ 
ing of galaxies. 


6 SUMMARY 

We have presented an analysis of the halo occupation distri¬ 
bution for emission line galaxies, which jointly accounts for 
three measurements: the angular correlation function, the 
monopole, and the weak leasing signal around ELGs (see 
Section . Our procedure can be summarized in the follow¬ 
ing points: 

• Apply the SUGAR (Rodriguez-Torres et al. (2015), 
in prep.) algorithm to the 11 snapshots available from the 
MDPL simulation to construct a light-cone (Section |2.2|) , 
with the same geometry and angular footprint of the ELG 
data. 

• Modify the traditional SHAM technique (Section 

to account for the ELG incompleteness, by selecting model 
galaxies by mass, until we match the observed ELG n(z). 
In this way, our mock is constrained by the observed ELG 
redshift distribution, and represents a reliable model. 

• Parametrize the probability of selecting a halo hosting 
a ELG with Eq. ^ in terms of the mean halo mass of the 
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Figure 7. VIPERS clustering trends as a function of the band and z—band magnitudes (top row: rest frame; bottom row: observed 
frame), [On] luminosity (top row) and [On] flux (bottom row). 
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Figure 8. Schematic diagram of possible ELG configurations. ELGs at 2 ; ~ 0.8 typically live in halos of mass ~ (lzbO.5) x 10^^ 
and 22.5% are satellites belonging to larger halos, whose central galaxy is quiescent. Among these satellite conflgurations, 21.2% of parent 
halos with M^q ~ 2.5 x 10^^ host one satellite ELG, and only 1.3% of parents host more than one satellite ELG. The maxium 

number of satellites, n = 1.8, is achieved in the highest-mass case, M^q ~ 6.8 X 10^^ See the text for details. 
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sample {Mmean)^ the dispersion around the mean (ctm), 
and the satellite fraction (fsat)- The additional parameter 
“flag” enables to distinguish central and satellite halos. 

• We perform two experiments (see Section ^ on the 
MDPL light-cone to derive information on which are the 
halo mass and satellite fraction ranges of values we need 
to input in our modified SHAM model to correctly fit the 
ELG clustering signal. 

• Construct a grid of models based on these values, and 
jointly fit both angular and redshift-space clustering (see 
Section Our best-fit models (see Figure are degener¬ 
ate with respect to Mmean and fsat- The combination with 
the weak leasing analysis (see Section 3.1) breaks this degen¬ 
eracy and rules out the highest and lowest mass models. Our 
best-fit (x^ = 1) model is shown in Figure |3| together with 
the ELG measurements, and is given by log M^ean = 12, 
fsat — 22.5%, <JM — ^mean 

To conclude, we have built and released to the com¬ 
munity a reliable galaxy mock catalog that correctly fits 
the clustering amplitude of the ugri ELG sample con¬ 
structed by matching spectroscopic redshifts from BOSS 
DR12, VIPERS and DEEP2 (for details see Section H). With 
these tools, we can begin building many realizations of the 
density field to predict errors on the BAO measurement. 

The measured halo masses for halos hosting emission¬ 
line galaxies indicate that we are sampling the halos that 
form stars in the most efficient way, according the stellar- 
to-halo mass ratio discussed by [Behroozi et al.| (|2013a]) (see 
their Figure 1, bottom panel). This is an important point for 
the future, and opens the path to further studies to under¬ 
stand the correlation between clustering and the strength 
of emission lines. With the resolution available from cur¬ 
rent data, we are not able to push the analysis to the typi¬ 
cal scales (~ 200/i“^kpc) of halos of 10^^ however, 

next-generation surveys, as eBOSS and DESI, will provide 
better resolution, and in the near future we should be able to 
build robust combinations of N-body simulations and SAMs 
that will address those questions. 
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